System, device, and method for time-domain equalizer training using a two-pass auto-regressive moving average model

ABSTRACT

A system, device, and method for time-domain equalizer training uses a two-pass auto-regressive moving average model. A communication channel is first modeled using p 1  poles and q 1  zeros to form a first shortened channel impulse response having a first approximation H 1 (z)=B 1 (z)/1+A 1 (z), wherein q 1 , is greater than a predetermined cyclic prefix length. A time-mirrored image of the first shortened channel impulse response is then formed, and the resulting time-mirrored image of the first shortened channel impulse response is modeled using p 2  poles and q 2  zeros to form a second shortened channel impulse response having a second approximation H 2 (z)=B 2 (z)/1+A 2 (z), wherein q 2  is less than or equal to the predetermined cyclic prefix length. The time-domain equalizer coefficients are determined by combining A 1 (z) and A 2 (1/z) with an appropriate amount of delay.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application may be related to the following commonly-owned United States patent applications, which are hereby incorporated by reference in their entities:

U.S. patent application Ser. No. 09/542,212 entitled SYSTEM, DEVICE, AND METHOD FOR TIME-DOMAIN EQUILIZER TRAINING USING AN AUTO-REGRESSIVE MOVING AVERAGE MODEL, fled on even date herewith in the names of Qingli Liu and Aleksandar Purkovic;

U.S. patent application Ser. No. 09/542,336 entitled SYSTEM, DEVICE, AND METHOD FOR TIME-DOMAIN EQUILIZER TRAINING USING INJECTED OUT-OF-BAND NOISE, filed on even date herewith in the names of Aleksandar Purkovic and Qingli Liu; and

U.S. patent application Ser. No. 09/547,229 entitled SYSTEM, DEVICE, AND METHOD FOR DETERMINING THE SAMPLING TIME FOR A TIME DOMAIN EQUALIZER, filed on even date herewith in the names of Qingli Liu and Aleksandar Purkovic.

FIELD OF THE INVENTION

The present invention relates generally to communication systems, and more particularly to time-domain equalization of a communication channel.

BACKGROUND OF THE INVENTION

FIG. 1 shows a typical communication system 100 having a local/central unit 102 in communication with a remote unit 106 over a communication medium 104. Generally speaking, the communication medium 104 supports bi-directional communications between the local/central unit 102 and the remote unit 106. For convenience, communications from the local/central unit 102 to the remote unit 106 are said to be “downstream” communications, while communications from the remote unit 106 to the local/central unit 102 are said to be “upstream” communications. Thus, the communication medium 104 typically supports a downstream channel over which the local/central unit 102 communicates to the remote unit 106 and an upstream channel over which the remote unit 106 communicates to the local/central unit 102. The upstream channel and the downstream channel may share the same physical communication link or occupy different physical communication links. When the upstream channel and the downstream channel share the same physical communication link, the upstream channel and the downstream channel may occupy the same frequency band (e.g., analog modem channels) or different, typically non-overlapping, frequency bands (e.g., ADSL or cable modem channels). The upstream and downstream channels may be symmetric or asymmetric.

Within the communication system 100, it is common for the upstream and downstream communication channels to have dispersive characteristics. Specifically, each channel has a particular impulse response that disperses signals carried over the channel by extending the effects of each signal over a period of time. In many cases, the dispersive nature of the channel causes various distortions of the signals carried over the channel, such as Inter-Symbol Interference (ISI), Inter-Carrier Interference (ICI), and other distortions.

FIG. 2A shows a representation of an exemplary transmit signal 210 that is transmitted over a dispersive channel, for example, by the local/central unit 102. The exemplary transmit signal 210 includes two symbols, S₁ (211) and S₂ (212), that are transmitted over the dispersive channel with no inter-symbol delay.

FIG. 2B shows a representation of an exemplary receive signal 220 that is received over the dispersive channel, for example, by the remote unit 106, when the symbols S₁ (211) and S₂ (212) are transmitted over the dispersive channel with no inter-symbol delay. As shown in FIG. 2B, the transmitted symbol S₁ (211) is dispersed by the dispersive channel such that the received symbol R₁ (221) overlaps the beginning of the symbol S₂ (212). This causes ISI between the symbols S₁ (211) and S₂ (212) and therefore corruption of the symbol S₂ (212).

One way to avoid or reduce ISI is to add a sufficient amount of inter-symbol delay to the transmitted symbols so that the received symbols do not overlap.

FIG. 3A shows a representation of an exemplary transmit signal 310 that is transmitted over a dispersive channel, for example, by the local/central unit 102. The exemplary transmit signal 310 includes two symbols, S₁ (311) and S₂ (312), that are transmitted over the dispersive channel with inter-symbol delay.

FIG. 3B shows a representation of an exemplary receive signal 320 that is received over the dispersive channel, for example, by the remote unit 106, when the symbols S₁ (311) and S₂ (312) are transmitted over the dispersive channel with inter-symbol delay. As shown in FIG. 3B, the transmitted symbols S₁ (311) and S₂ (312) are dispersed by the dispersive channel. However, because of the inter-symbol delay in the transmitted signal, the received symbols R₁ (321) and R₂ (322) do not overlap. As a result, there is no ISI between the symbols S₁ (311) and S₂ (312).

While the inter-symbol delay added to the transmitted signal eliminates (or at least reduces) ISI, there are detriments to employing such inter-symbol delay. For one, the inter-symbol delay reduces the efficiency of the transmitted signal in that fewer symbols (and therefore less data) are transmitted over a particular period of time. Also, the inter-symbol delay can cause cross-talk between channels carried over a common physical communication link. Thus, inter-symbol delay may be impractical for certain applications.

Another way to avoid or reduce ISI is to “shorten” the impulse response of the channel. This is typically done using a time-domain equalizer (TEQ) at the receiving end of the communication channel. The TEQ is a short Finite Impulse Response (FIR) filter that is used to time-compress (shorten) the impulse response of the communication channel. In addition to shortening the impulse response of the channel, the TEQ also tends to “flatten” the channel and amplify noise. The effectiveness of the TEQ has a direct impact on overall performance, and therefore the TEQ design and the TEQ coefficients must be carefully determined. There are typically different design considerations for time-domain equalization of the upstream and downstream channels. Also, whether the implementation platform is memory or processing power limited (or both) plays an important role in the TEQ design.

The following references are hereby incorporated herein by reference in their entireties, and may be referenced throughout the specification using the corresponding reference number. It should be noted that the reference numbers are not consecutive.

-   [1] John A. C. Bingham, ADSL, VDSL and Multicarrier Modulation, John     Wiley & Sons, 2000. -   [2]J. S. Chow, J. M. Cioffi, and J. A. C. Bingham “Equalizer     Training Algorithms for Multicarrier Modulation Systems,” ICC 1993,     May 1993, pp. 761–765. -   [3] D. D. Falconer and F. R. Magee, Jr., “Adaptive Channel Memory     Truncation for maximum Likelihood Sequence Estimator,” B.S.T.J.     November 1973, pp. 1541–1562. -   [5] D. T. Lee, B. Friedlander, and M. Morf, “Recursive Ladder     Algorithms for ARMA Modeling,” IEEE Trans. Automat. Contr., vol.     AC-27, No. 4, August 1982. -   [6] P. J. W. Melsa, R. C. Younce, C. E. Rohrs, “Impulse Response     Shortening for Discrete Multitone Transceivers,” IEEE Trans.     Commun., vol. 44, No. 12, pp. 1662–1672, December 1996. -   [7] N. Al-Dhahir, A. H. Sayed, and J. M. Cioffi, “Stable pole-zero     modeling of long FIR filters with application to the MMSE-DFE,” IEEE     Trans. Commun., vol. 45, No. 5, pp. 508–513, 1997. -   [8] N. Al-Dhahir, A. H. Sayed, and J. M. Cioffi, “A high-performance     cost-effective pole-zero MMSE-DFE,” Proc. Allerton Conf. Commun.,     Contr., Computing, September 1993, pp. 1166–1175. -   [10] Steven M. Kay, Modern Spectral Estimation: Theory and     Application, Prentice Hall, 1988. -   [11] Peter E. Caines, Linear Stochastic Systems, John Wiley & Sons,     1988. -   [13] N. Al-Dhahir and J. M. Cioffi, “A low complexity pole-zero MMSE     Equalizer for ML Receivers,” Proc. Allerton Conf. Commun., Control,     Comput., pp. 623–632, 1994.

SUMMARY OF THE INVENTION

In accordance with an aspect of the invention, the coefficients for a time-domain equalizer are determined using a two-pass auto-regressive moving average model. A communication channel is first modeled using p₁ poles and q₁ zeros to form a first shortened channel impulse response having a first approximation H₁(z)=B₁(z)/1+A₁(z), wherein q₁ is greater than a predetermined cyclic prefix length. A time-mirrored image of the first shortened channel impulse response is then formed, and the resulting time-mirrored image of the first shortened channel impulse response is modeled using p₂ poles and q₂ zeros to form a second shortened channel impulse response having a second approximation H₂(z)=B₂(z)/1+A₂(z), wherein q₂ is less than or equal to the predetermined cyclic prefix length. The time-domain equalizer coefficients are determined by combining A₁(z) and A₂(1/z) with an appropriate amount of delay.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other objects and advantages of the invention will be appreciated more fully from the following further description thereof with reference to the accompanying drawings, wherein:

FIG. 1 is a network diagram showing an exemplary communication system having a local/central unit in communication with a remote unit over a communication medium;

FIG. 2A is a representation of an exemplary transmit signal having two symbols that are transmitted over a dispersive channel with no inter-symbol delay;

FIG. 2B is a representation of an exemplary receive signal that is received over a dispersive channel when symbols are transmitted over the dispersive channel with no inter-symbol delay;

FIG. 3A is a representation of an exemplary transmit signal having two symbols that are transmitted over a dispersive channel with inter-symbol delay;

FIG. 3B is a representation of an exemplary receive signal that is received over a dispersive channel when symbols are transmitted over the dispersive channel with inter-symbol delay;

FIG. 4 is a representation of an exemplary transmit signal having two symbols that are transmitted over a dispersive channel with cyclic prefixing;

FIG. 5 is a representation of an exemplary receive signal that is received over a dispersive channel when symbols are transmitted over the dispersive channel with cyclic prefixing;

FIG. 6 shows the impulse responses of simulated downstream and upstream G.lite ADSL channels;

FIG. 7 is a block diagram showing the relevant logic blocks of an exemplary ADSL terminal unit in accordance with an embodiment of the present invention;

FIG. 8 is a block diagram showing the major logic blocks of an exemplary Discrete Multi-Tone (DMT) transceiver in accordance with an embodiment of the present invention;

FIG. 9 is a representation of a time-domain equalized channel;

FIG. 10 shows the shortened channel impulse response delay obtained from a reference model as known in the art;

FIG. 11 shows the shortened channel impulse response delay obtained heuristically for a reference model as known in the art;

FIG. 12 is a logic flow diagram showing exemplary logic for determining TEQ coefficients in accordance with an embodiment of the present invention;

FIG. 13 is a logic flow diagram showing exemplary logic for determining TEQ coefficients in accordance with an embodiment of the present invention;

FIG. 14 is a logic flow diagram showing exemplary logic for determining TEQ coefficients in accordance with an embodiment of the present invention;

FIG. 15 is a logic flow diagram showing exemplary logic for detecting instability in accordance with an embodiment of the present invention;

FIG. 16 shows the compressed (shortened) impulse response obtained by a reference model for simulated downstream and upstream G.lite ADSL channels;

FIG. 17 shows the compressed (shortened) impulse response obtained by an auto-regressive moving average model of the present invention;

FIG. 18 shows a break-out of a channel impulse response into a head, a body, and a tail;

FIG. 19 shows the signal to total distortion ratio obtained from a reference model and from an auto-regressive moving average model of the present invention;

FIG. 20 demonstrates a two-pass shortening model of the present invention;

FIG. 21 shows the signal to total distortion ratio obtained from a reference model and from a two-pass shortening model of the present invention;

FIG. 22 shows the shortened channel impulse response delay obtained from a two-pass shortening model of the present invention; and

FIG. 23 is a logic flow diagram showing exemplary logic for determining TEQ coefficients using a two-pass shortening model of the present invention.

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT

An embodiment of the present invention applies the multichannel Levinson algorithm for auto-regressive moving average (ARMA) modeling of the channel impulse response. A two-pass shortening model is used for obtaining TEQ coefficients for a spread channel, such as the upstream channel of an Asymmetric Digital Subscriber Line (ADSL) system. In this two-pass shortening model, the communication channel is modeled in two passes, first for the original channel, and then to the time-mirrored image of the result from the first pass. The TEQ parameters are obtained by cascading the filter corresponding to the denominator polynomial of the first pass modeling and the filter corresponding to the mirrored image of the denominator polynomial of the second pass modeling. Delay is introduced so that the resulting shortened channel is closer to the optimal range.

Various TEQ design considerations are discussed herein with reference to an exemplary communication system that is generally known as Asymmetric Digital Subscriber Line (ADSL). However, it should be noted that ADSL is simply an exemplary embodiment, and many of the TEQ design considerations are applicable in other networking environments.

An ADSL communication system provides high-speed digital communications between a telephone company central office and a subscriber site (e.g., a customer home or business) over existing twisted-pair copper telephone wires (i.e., the analog local loop, which is referred to in ADSL parlance as the “subscriber loop”). More specifically, the ADSL communication system provides high-speed digital communications between a central ADSL terminal unit (ATU-C) at the telephone company central office and a remote ADSL terminal unit (ATU-R) at the subscriber site that communicate over the subscriber loop. The communication channel from the ATU-C to the ATU-R is considered to be the downstream channel, while the communication channel from the ATU-R to the ATU-C is considered to be the upstream channel. The upstream and downstream channels usually occupy non-overlapping frequency bands on the subscriber loop, with the upstream channel at a lower frequency band than the downstream channel. The downstream channel provides more communication bandwidth than the upstream channel, and therefore downstream communications are faster than upstream communications.

Discrete Multitone (DMT) has been adopted as the modulation of choice in several ADSL standards. DMT was adopted first by ANSI (T1.413 standard) and then by the ITU-T (G. 992.1 and G. 992.2, formerly G.dmt and G.lite, respectively) for high-speed data transmission over twisted-pair copper wire. In DMT-based transmission, the available signal bandwidth in either the upstream or the downstream direction is divided into multiple parallel orthogonal subchannels. The information bit stream is broken into multiple groups, with each group used to select a quadrature-amplitude-modulation (QAM) constellation point, which in turn modulates a subcarrier in the specific subchannel. One of the key design considerations for a DMT modem is the use of the Inverse Discrete Fourier Transform (IDFT) to modulate the DMT signal and the Discrete Fourier Transform (DFT) to demodulate the DMT signal. The use of DFT/IDFT requires that orthogonality between the subchannels be maintained at the receiver. The DFT and IDFT are effectively performed by using the Fast Fourier Transform (FFT) and Inverse FFT (IFFT), respectively.

Efficient equalization of such subscriber loops can be carried out in the frequency domain by using a single-tap complex equalizer for each subchannel, but only if orthogonality between subchannels is maintained. This means there can be neither inter-carrier interference (ICI) nor inter-symbol interference (ISI) in the DMT signal. Also, in order to use DFT to perform demodulation, the received signal must appear periodic within a block of samples. The approach used in the DMT transceiver to accomplish this periodicity is to prefix a block of samples corresponding to a DMT symbol, with the last v samples of the same block. The name used for these particular v samples is the cyclic prefix (CP).

FIG. 4 shows a representation of an exemplary transmit signal 400 that is transmitted over an ADSL channel, for example, by the ATU-C over the downstream channel. The exemplary transmit signal includes two symbols, S₁ (410) and S₂ (420). Each transmit symbol consists of T=2N+v samples, where N equals the number of DMT subchannels. Each transmit symbol (410, 420) includes cyclic prefix samples (411, 421) and data samples (412, 422), where the cyclic prefix samples (411, 421) are the last v samples of the data samples (412, 422).

While the CP can make the received signal appear periodic, it also reduces the information-carrying capacity of each symbol. Therefore, in order to maintain high transmission efficiency, the CP length should be short relative to the DMT symbol length. For this reason, the value v is fixed in various ADSL recommendations, typically to one-eighth the number of DMT subchannels (i.e., v=N/8). An ADSL implementation typically utilizes 256 DMT subchannels (G.dmt) or 128 DMT subchannels (G.lite) on the downstream channel, and therefore v is equal to 32 or 16 samples, respectively, on the downstream channel. An ADSL implementation typically utilizes 32 DMT subchannels on the upstream channel, and therefore v is equal to 4 samples on the upstream channel.

In order for the CP to make the received signal appear periodic, the amount of dispersion imparted to a transmitted symbol by the communication channel cannot extend past the CP of the following transmitted symbol. In other words, the impulse response of the channel cannot cause the length of the received symbol to be greater than 2N+v samples, where N equals the number of DMT subchannels and v equals the CP length. This is demonstrated by example in FIG. 5, which shows a representation of an exemplary receive signal 500 in which the received symbol R₁ (510), corresponding to the transmitted symbol S₁ (410), does not extend past the CP of the transmitted symbol S₂ (420). This puts a significant constraint on impulse response of the communication channel.

Unfortunately, most subscriber loops are relatively long and have impulse responses that are much longer than the CP length. This can be demonstrated in an exemplary G.lite system that is simulated using ANSI loop #7 with G.lite transceiver filters. FIG. 6( a) shows the impulse response of a simulated downstream G.lite channel. FIG. 6( b) shows the impulse response of a simulated upstream G.lite channel. The impulse response length of each channel (approximately 125 samples for the downstream channel and approximately 40 samples for the upstream channel) is significantly longer than the CP length (16 samples on the downstream channel and 4 samples on the upstream channel). Thus, out of 2N+v received samples, more than v samples will be corrupted due to dispersion, resulting in both ISI and ICI in the received signal.

Solutions to this problem have been sought in both frequency and time domains. The most common approach is to use a TEQ to compress the impulse response of the channel ideally to be equal to or less than the CP length. Then, orthogonality between subcarriers is ensured, and a simple (single complex tap per subcarrier) frequency domain equalizer (FEQ) can be used after the DFT.

In order for the TEQ to effectively compress the impulse response of the channel, the TEQ is “trained” to the impulse response of the channel. Specifically, the TEQ in the ATU-R is trained to the impulse response of the downstream channel, and the TEQ in the ATU-C is trained to the impulse response of the upstream channel. Training involves, among other things, determining a set of TEQ coefficients that provide the required amount of compression. Training is accomplished by transmitting a training signal over the channel and determining the TEQ coefficients based upon the received training signal. Thus, the ATU-R trains its TEQ based upon a training signal transmitted by the ATU-C over the downstream channel, and the ATU-C trains its TEQ based upon a training signal transmitted by the ATU-R over the upstream channel. The training signal typically consists of repeated 2N samples of the pseudo-random number sequence, with no CP added.

FIG. 7 shows the relevant logic blocks of an exemplary ATU 700. Among other things, the ATU 700 includes a data port 702 that is coupled to a DMT transceiver 704, and training logic 706 that is also coupled to the DMT transceiver 704. The data port 702 provides an interface 701 to a computer system, such as a communication network at the telephone company central office (in the case of an ATU-C) or a personal computer at the subscriber site (in the case of an ATU-R), enabling the ATU 700 to send and receive information over the computer system. The DMT transceiver 704 provides an interface 705 to the subscriber loop, enabling the ATU 700 to send and receive information from another ATU over the subscriber loop. The training logic 706 includes various logic for training the components of the DMT transceiver 704, including, among other things, logic for sending a training signal over the subscriber loop and logic for determining TEQ coefficients based upon a received training signal.

FIG. 8 shows the relevant logic blocks of an exemplary DMT transceiver 704 in which only the main functions are included. The DMT transceiver 704 includes a transmitter 810 and a receiver 830 that are coupled to the subscriber loop 705 through a hybrid 820.

At the transmitter 810, transmit data (TxD) from the data port 701 is first processed by the TX data processing block 811, which performs such functions as scrambling, forward error correction (FEC) encoding, and bit loading. The resulting bit stream is then processed by the constellation generation and scaling block 812, which divides the bit stream into small groups, selects a QAM constellation point for each subcarrier (with each constellation point representing a frequency domain sample), scales each frequency domain sample, and extends each block of N frequency domain samples (where N is the number of subcarriers used for transmission) to N₂=2N samples in such a way that the samples are complex-conjugate symmetric around the N-th sample. The samples are then processed by the IDFT block 813, which performs an IDFT and forms a DMT block (symbol) from N₂ output time-domain, real-valued samples. Each DMT symbol is processed by the CP block 814, which adds the CP (i.e., the last v samples of the DMT symbol) to the DMT symbol to form a prefixed DMT symbol. The prefixed DMT symbol is processed by block 815, which converts the DMT symbol into a continuous-time signal for transmission over the subscriber loop 705 via the hybrid 820.

At the receiver 830, the continuous-time signal received over the subscriber loop 705 via the hybrid 820 is first processed by block 831, which filters and samples the received signal. The resulting samples are then processed by the TEQ block 832, which performs time-domain equalization and CP removal so that a DMT time-domain symbol with N₂ samples is obtained. The resulting samples are then processed by the DFT block 833, which performs a DFT on the time-domain symbol in order to transform the time-domain symbol into N₂ frequency samples (where only the first N samples are taken). These frequency samples are processed by the block 834, which performs gain scaling and FEQ on a per-subchannel basis, slices the sample in each subchannel to the nearest constellation point, and decodes the constellation points to recover the transmitted information bits. The resulting bit stream is processed by the RX data processing block 835, which performs such functions as bit unloading, forward error correction (FEC) decoding, and descrambling to produce receive data (RxD) that is forwarded to the data port 702.

As discussed above, the primary function of the TEQ is to “shorten” the transmission channel of a DMT signal so that the combined (shortened) channel has energy confined to the CP window. In order for the TEQ to equalize the channel effectively, the maximum length of the shortened channel impulse response following time-domain equalization is (v+1) samples, where v is the CP length. Additionally, the channel flattening and noise amplification effects of the TEQ must be considered.

The problem of determining the TEQ coefficients for channel impulse response shortening is discussed with reference to FIG. 9. In FIG. 9, x_(k) represents the transmitted signal, h_(k) represents the channel impulse response of the subscriber loop, n_(k) represents various noise components, y_(k) represents the combined signal (channel impulse response and noise) presented to the TEQ, w_(k) represents the impulse response of the TEQ, r_(k) represents the combined (shortened) impulse response of the channel, Δ represents the delay introduced by the transmission channel excluding the delay of the target channel, and b_(k) represent the target impulse response (TIR) of the target channel. It is assumed that the channel impulse response (h_(k)) includes all transmit and receive transceiver filters in addition to the actual local loop channel. The noise term (n_(k)) includes crosstalk, thermal, and quantization noise, as well as radio frequency interference (RFI). TEQ coefficients (w_(k)) are chosen such that the impulse response of the combined channel and TEQ filters (upper branch) matches as close as possible the lower branch. The TIR (b_(k)) has maximum length of (v+1) samples. A known pseudo-random sequence x_(k) with the period equal to the one DMT symbol interval is used for the TEQ training.

The error sequence e_(k) can be expressed as: e _(k) =w _(k) *y _(k) −b _(k) *x _(k−Δ) =w ^(T) y _(k) −b ^(T) x _(k)  (1) where “*” denotes linear convolution and the bold characters represent vectors (as well as matrices) as follows: w=[w₀, w₁, . . . , w_(M−1)]^(T), where M is the number of TEQ coefficients, b=[b₀, b₁, . . . , b_(v)]^(T), where v is the CP length, x_(k)=[x_(k−Δ), x_(k−Δ−I), . . . , x_(k−Δ−v)]^(T), y_(k)=[y_(k), y_(k−1), . . . , y_(k−M+1)]^(T).

The TEQ tap coefficients can be obtained, for example, by finding the values for TEQ (w), TIR (b), and delay (Δ) that minimize the Mean Squared Error (MSE), which is defined as: MSE=E{e _(k) ² }=E{(w ^(T) y _(k) −b ^(T) x _(k))² }=w ^(T) R _(yy) w+b ^(T) R _(xx) b−2w ^(T) R _(yx) ^(Δ) b, where the correlation matrices are defined as: R _(xx) =E{x _(k) x _(k) }, R _(yy) =E{y _(k) y _(k) }, R _(yx) ^(Δ) =E{y _(k) x _(k)}=(R _(xy) ^(Δ))^(T)  (2)

In order to avoid a trivial solution in which the TEQ and TIR taps are equal to zero, some constraints are typically imposed on TEQ or TIR coefficients.

One commonly used constraint requires that the norm of either the TEQ taps or the TIR taps equals unity (i.e., w^(T)w=1 or b^(T)b=1, respectively). In this case, the equalizer taps are obtained by solving the eigenvector problem [3].

Another commonly used constraint is the “unit tap constraint” (UTC), which requires one of the TIR taps to equal unity [13]. This can be written as b^(T)e_(i)=1, where e_(i) is a unit vector having all elements zero except for the i-th element, which equals unity. For convenience, a model that uses such a constraint is referred to hereinafter as a “UTCb” model, and is used as a reference model for performance comparisons against results obtained from various embodiments of the present invention.

UTCb Reference Model

Briefly, the UTCb model obtains the preferred TEQ coefficients by determining TEQ coefficients for each Δε[0, N₂−v−1] and selecting as the preferred TEQ coefficients the TEQ coefficients providing the overall minimum MSE. The UTCb model determines TEQ coefficients for a particular Δ by performing the following steps:

Step 1—Applying second order statistics (2) and the selected value Δ, calculate the following (v+1) by (v+1) matrix: Q _(w) ^(Δ) =R _(xx) −R _(xy) ^(Δ) R _(yy) ⁻¹ R _(yx) ^(Δ)

Step 2—Find minimum diagonal element (i_(opt), i_(opt) position) of: (Q_(w) ^(Δ))⁻¹, i.e. [(Q_(w) ^(Δ))⁻¹]_(i) _(opt) _(, i) _(opt)

Step 3—Calculate TIR coefficients and corresponding MSE:

b opt = ( Q w ) - 1 ⁢ e i opt [ ( Q w ) - 1 ] i opt ⁢ i opt , ⁢ MSE opt = 1 [ ( Q w ) - 1 ] i opt ⁢ i opt ⁢

Step 4—Calculate corresponding TEQ coefficients: w _(opt) =R _(yy) ⁻¹ R _(yx) ^(Δ) b _(opt)

The preferred TEQ coefficients are those that give the overall minimum MSE, i.e., (w_(opt))_(minMSE).

The UTCb model performs well for time-domain equalization of ADSL channels. One problem with the UTCb model, however, is that implementation of the UTCb model is computationally intensive. This is due, in part, because iterations are performed for each Δε[0, N₂−v−1].

Fewer iterations can be performed without loss of performance by limiting the search range to Δε[Δ₁, N₂−v−1], where Δ₁ is the sample order number (k) of h_(k) where the block of v most powerful consecutive samples begins. FIG. 10 shows the value Δ₁ for a simulated downstream G.lite channel.

Furthermore, very little performance is lost if the UTCb model is applied for a single, heuristically found value of Δ=Δ₂=Δ₁+M/2, where M is the TEQ filter length. As shown in FIG. 1, the value Δ₂ provides an acceptably low MSE, although it may not be the optimal value for Δ overall.

Two-Channel Auto-Regressive (TC-AR) Model

In an embodiment of the present invention, the TEQ design is based upon an auto-regressive moving average (ARMA) model, which in turn is converted into a two-channel auto-regressive (TC-AR) model through embedding. The approach of embedding an ARMA model into a TC-AR model is discussed in [5]. Certain advantages of using embedding are discussed in [6].

In the ARMA model, the input and output signals of discrete multitones (DMT) are associated with the parameters of a zero polynomial and a pole polynomial, respectively, thereby resulting in a pole-zero model. The ARMA (pole-zero) model of the channel results in a non-Toeplitz autocorrelation matrix for which there are no computationally efficient solutions.

However, embedding an ARMA model into a TC-AR model produces a model consisting of poles only. With embedding, the input and output signals form a column vector of dimension two, or two channels, and correlation parameters related to the predication vector of the TC-AR model are represented by 2×2 matrices. Thus, the TC-AR model of the channel results in a Toeplitz autocorrelation matrix, the elements of which are 2×2 matrices rather than scalars. Application of this approach to the shortening of long feed-forward and feed-back FIR filters of a channel equalizer is discussed in [7] and [8].

The parameters of the TC-AR model can be solved recursively using the well-known Levinson, Wiggins, Whittle, Robinson (LWWR) algorithm, which is referred to hereinafter as the multichannel Levinson algorithm. A full derivation of the multichannel Levinson algorithm is presented in [9], [10], and [11].

Briefly, the ARMA (pole-zero) model assumes that the transfer function of the unknown system, H(z), has the form:

$\begin{matrix} {{\hat{H}(z)} = {\frac{\hat{Y}(z)}{X(z)} = {{\frac{\sum\limits_{k = 0}^{q}\;{b_{k}z^{- k}}}{1 + {\sum\limits_{k = 1}^{p}\;{a_{k}z^{- k}}}} \approx \frac{Y(z)}{X(z)}} = {H(z)}}}} & (3) \end{matrix}$ where p is the number of poles, q the number of zeros, and X(z) and Y(z) are the input and output of the unknown system, respectively. An equivalent representation in the time domain is: e _(k) =y _(k) −ŷ _(k)  (4) where y_(k) is the output sample and ŷ_(e) is the modeled output sample.

The output sample y_(k) is represented by

$y_{k} = {\sum\limits_{0}^{L}\;{h_{l}{x_{k - l}.}}}$ The ARMA (pole-zero) model is used to model the input-output relationship, which is characterized by the physical channel impulse response {h_(k)}. Starting with a (p,q) pole-zero model, where p is the number of poles and q is the number of zeros of the model, the modeled output sample, ŷ_(k), can be written as: ŷ _(k) =−a ₁ ŷ _(k−1) −a ₂ ŷ _(k−2) − . . . −a _(p) ŷ _(k−p) +b ₀ x _(k) +b ₁ x _(k−1) + . . . +b _(q) x _(k−q)  (5)

For the cases where p≧q, the objective is to estimate the (p+q+1) ARMA parameters {a₁, . . . , a_(p), b₀, . . . , b_(q)} based upon the knowledge of the second-order statistics of the input and output sequences {y_(k), x_(k)}. Denoting these estimates at the jth recursion (1≦j≦p) by {a₁ ^(j), . . . , a_(j) ^(j), b₀ ^(j), . . . , b_(j−δ) ^(j)} where δ=p−q, the output sample y_(k) can be represented as: y _(k) =ŷ _(k) +e _(k) ^(j) =−a ₁ ^(j) ŷ _(k−1) −a ₂ ^(j) ŷ _(k−2) − . . . −a _(j) ^(j) ŷ _(k−j) +b ₀ ^(j) x _(k) +b ₁ ^(j) x _(k−1) + . . . +b _(j−δ) ^(j) x _(k−j−δ) +e _(k) ^(j),  (6) where e_(k) ^(j) is the jth-order residual error sequence and b_(i) ^(j)=0 for i<0. At the jth recursion, an estimate of the ARMA model is determined in which the difference between the number of poles and zeros is also δ, i.e., equal to the desired difference p−q.

Estimating the parameters of the ARMA model described in (5) needs the correlation matrix. In this case, it is a non-Toeptitz matrix for which there exists no computationally efficient solution. Instead, the method of emdedding an ARMA model into a two-channel AR model is used to generate a model consisting of poles only [5], [7], [8]. The correlation matrix then takes a Toeplitz form, and the multichannel Levinson algorithm can be used to solve the model parameters.

Assuming that {x_(k)} is white (which is approximately true for the time-domain DMT signal) and that ŷ_(k−i)=y_(k−i) for 1≦i≦p, (i.e., that the previous estimates are accurate), and defining a vector

${z_{k} = \begin{bmatrix} y_{k} \\ x_{k - \delta} \end{bmatrix}},$ then the ARMA model of (6) can be converted to the two-channel-AR model as:

${z_{k} = {{{{- \Theta_{1}^{j}}z_{k - 1}} - \cdots - {\Theta_{\delta - {1j}}z_{k - \delta - 1}} - {\Theta_{\delta}^{j}z_{k - \delta}} - \cdots - {\Theta_{j}^{j}z_{k - j}} + \begin{bmatrix} e_{k}^{j} \\ x_{k + \delta} \end{bmatrix}} = {{\hat{z}}_{k} + e_{k}}}},$ in which the forward prediction coefficient matrices of the linear predictor {circumflex over (z)}_(k) are defined as:

$\begin{matrix} {{- \Theta_{i}^{j}} = \left\{ \begin{matrix} {\begin{bmatrix} {- a_{i}^{j}} & 0 \\ 0 & 0 \end{bmatrix},\mspace{14mu}{1 \leq i \leq {\delta - 1}}} \\ {\begin{bmatrix} {- a_{i}^{j}} & b_{i - \delta}^{j} \\ 0 & 0 \end{bmatrix},{\delta \leq i \leq {j.}}} \end{matrix} \right.} & (7) \end{matrix}$

Applying the orthogonality principle, which deals with the real signal only and states that E[(z_(k)−{circumflex over (z)}_(k))z_(k) ^(T)]=0 (where i=1, 2, . . . j and T denotes transpose), results in:

$\begin{matrix} {{E\left\lbrack {\left( {z_{k} + {\sum\limits_{m = 1}^{j}\;{\Theta_{m}^{j}z_{k - m}}}} \right)z_{k - i}^{T}} \right\rbrack} = 0} & (8) \end{matrix}$ or equivalently: R(i)+Θ₁ ^(j) R(i−1)+ . . . +Θ_(j) ^(j) R(i−j)0, 1≦i≦j  (9) in which the autocorrelation matrix is defined as:

$\begin{matrix} {{R(i)} = {{E\left\lbrack {z_{k}z_{k - i}^{T}} \right\rbrack} = {\begin{bmatrix} {R_{yy}(i)} & {R_{yx}\left( {i - \delta} \right)} \\ {R_{xy}\left( {i - \delta} \right)} & {R_{xx}(i)} \end{bmatrix} = {{R^{T}\left( {- i} \right)}.}}}} & (10) \end{matrix}$

Combining (8), (9), and (10) for the case i=0 results in:

$\begin{matrix} {{E\left\lbrack {\left( {z_{k} - {\hat{z}}_{k}} \right)z_{k}^{T}} \right\rbrack} = {{E\left\lbrack {\left( {z_{k} + {\sum\limits_{m = 1}^{j}\;{\Theta_{m}^{j}z_{k - m}}}} \right)z_{k}^{T}} \right\rbrack} = {{{R(0)} + {\sum\limits_{m = 1}^{j}\;{\Theta_{m}^{j}{R\left( {- m} \right)}}}} = {{E\left\lbrack {e_{k}e_{k}^{T}} \right\rbrack} = {\sum\limits_{j}^{f}\;.}}}}} & \left( {11a} \right) \end{matrix}$

That is, R(0)+Θ₁ ^(j) R(−1)+ . . . +Θ_(j) ^(j) R(−j)=Σ_(j) ^(f).  (11b)

Combining (9) and (11b) results in the jth-order two-channel AR model in matrix form, as follows:

$\begin{matrix} {{\left\lbrack {\Theta_{i}^{j}\ldots\mspace{14mu}\Theta_{1}^{j}I} \right\rbrack\begin{bmatrix} {R(0)} & \ldots & {R\left( {- \left( {j - 1} \right)} \right)} & {R\left( {- j} \right)} \\ \vdots & ⋰ & \vdots & \vdots \\ {R\left( {j - 1} \right)} & \ldots & {R(0)} & {R\left( {- 1} \right)} \\ {R(j)} & \ldots & {R(1)} & {R(0)} \end{bmatrix}} = {\left\lbrack {0\mspace{14mu}\ldots\mspace{14mu} 0\sum\limits_{j}^{f}}\; \right\rbrack.}} & (12) \end{matrix}$

Similarly, the equation for the backward prediction vector can be derived as:

$\begin{matrix} {{\left\lbrack {I\;\Theta_{1}^{j}\mspace{14mu}\cdots\mspace{14mu}\Theta_{j}^{j}} \right\rbrack\left( \begin{matrix} {R(0)} & \cdots & {R\left( {- \left( {j - 1} \right)} \right)} & {R\left( {- j} \right)} \\ \vdots & ⋰ & \vdots & \vdots \\ {R\left( {j - 1} \right)} & \cdots & {R(0)} & {R\left( {- 1} \right)} \\ {R(j)} & \cdots & {R(1)} & {R(0)} \end{matrix} \right)} = {\left\lbrack {\sum\limits_{j}^{b}\;{0\mspace{14mu}\cdots\mspace{14mu} 0}} \right\rbrack.}} & (13) \end{matrix}$

Combining (12) and (13) results in:

$\begin{matrix} {{\left\lbrack {\begin{matrix} \Theta_{j}^{j} \\ {{I\;\Phi_{1}^{j}}\mspace{14mu}} \end{matrix}\begin{matrix} \ldots & {\Theta_{1}^{j}I} \\ \ldots & \Phi_{j}^{j} \end{matrix}} \right\rbrack\left\lbrack \begin{matrix} {R(0)} & \cdots & {R\left( {- \left( {j - 1} \right)} \right)} & {R\left( {- j} \right)} \\ \vdots & ⋰ & \vdots & \vdots \\ {R\left( {j - 1} \right)} & \ldots & {R(0)} & {R\left( {- 1} \right)} \\ {R(j)} & \ldots & {R(1)} & {R(0)} \end{matrix} \right\rbrack} = {\quad{\left\lbrack \begin{matrix} 0 & \ldots & 0 & {\sum\limits_{j}^{f}\;} \\ {\sum\limits_{j}^{b}\;} & 0 & \ldots & 0 \end{matrix} \right\rbrack.}}} & (14) \end{matrix}$

In the next iteration (i.e., j→j+1), (14) becomes:

$\begin{matrix} {{\left\lbrack \begin{matrix} \Theta_{j + 1}^{j + 1} & \ldots & \Theta_{1}^{j + 1} & I \\ I & \Phi_{1}^{j + 1} & \ldots & \Phi_{j + 1}^{j + 1} \end{matrix} \right\rbrack\left\lbrack \begin{matrix} {R(0)} & \ldots & {R\left( {- j} \right)} & {R\left( {- \left( {j + 1} \right)} \right)} \\ \vdots & ⋰ & \vdots & \vdots \\ {R(j)} & \ldots & {R(0)} & {R\left( {- 1} \right)} \\ {R\left( {j + 1} \right)} & \ldots & {R(1)} & {R(0)} \end{matrix} \right\rbrack} = {\quad{\left\lbrack \begin{matrix} 0 & \ldots & 0 & {\sum\limits_{j + 1}^{f}\;} \\ {\sum\limits_{j = 1}^{b}\;} & 0 & \ldots & 0 \end{matrix} \right\rbrack.}}} & (15) \end{matrix}$

The following are now defined:

$\begin{matrix} {{\begin{bmatrix} 0 & \ldots & 0 & {\sum\limits_{j + 1}^{f}\;} \\ {\sum\limits_{j + 1}^{b}\;} & 0 & \ldots & 0 \end{bmatrix} = {\begin{bmatrix} I & K_{j + 1}^{f} \\ K_{j + 1}^{b} & I \end{bmatrix}\begin{bmatrix} \alpha & \ldots & 0 & {\sum\limits_{j}^{f}\;} \\ {\sum\limits_{j}^{b}\;} & 0 & \ldots & \beta \end{bmatrix}}},} & (16) \end{matrix}$ and

$C^{j + 1} = {\begin{bmatrix} {R(0)} & \ldots & {R\left( {- j} \right)} & {R\left( {- \left( {j + 1} \right)} \right)} \\ \vdots & ⋰ & \vdots & \vdots \\ {R(j)} & \ldots & {R(0)} & {R\left( {- 1} \right)} \\ {R\left( {j + 1} \right)} & \ldots & {R(1)} & {R(0)} \end{bmatrix}.}$

Combining (15) and (16) results in:

$\begin{matrix} {{\begin{bmatrix} \Theta_{j + 1}^{j + 1} & \ldots & \Theta_{1}^{j + 1} & I \\ I & \Phi_{1}^{j + 1} & \ldots & \Phi_{j + 1}^{j + 1} \end{bmatrix} C^{j + 1}} = {\quad{\left\lbrack \begin{matrix} {\alpha +} & K_{j + 1}^{f} & \sum\limits_{j}^{b} & 0 & \ldots & {\sum\limits_{j}^{f}\;} & K_{j + 1}^{f} & \beta \\ K_{j + 1}^{b} & {\alpha +} & \sum\limits_{j}^{b} & 0 & \ldots & K_{j + 1}^{b} & {\sum\limits_{j}^{f}\; +} & \beta \end{matrix} \right\rbrack.}}} & (17) \end{matrix}$

Rewriting the right side of (17) and also using (15) results in (18), as follows:

$\begin{bmatrix} {{K_{j + 1}^{f}\left\lbrack {\sum\limits_{j}^{b}\mspace{14mu}{0\mspace{14mu}\ldots\mspace{14mu} 0\mspace{14mu}\beta}} \right\rbrack} + \left\lbrack {\alpha\mspace{14mu} 0\mspace{14mu}\ldots\mspace{14mu} 0\mspace{14mu}\sum\limits_{j}^{f}} \right\rbrack} \\ {{K_{j + 1}^{b}\left\lbrack {\alpha\mspace{14mu} 0\mspace{14mu}\ldots\mspace{14mu} 0\mspace{14mu}\sum\limits_{j}^{f}} \right\rbrack} + \left\lbrack {\sum\limits_{j}^{f}\mspace{14mu}{0\mspace{14mu}\ldots\mspace{14mu} 0\mspace{14mu}\beta}} \right\rbrack} \end{bmatrix} = {\quad{\left\lbrack \begin{matrix} {{K_{j + 1}^{f}\left\lbrack {I\mspace{14mu}\Phi_{1}^{j}\mspace{14mu}\ldots\mspace{14mu}\Phi_{j}^{j}\mspace{14mu} 0} \right\rbrack} + \left\lbrack {0\mspace{14mu}\Theta_{j}^{j}\mspace{14mu}\ldots\mspace{14mu}\Theta_{1}^{j}\mspace{14mu} I} \right\rbrack} \\ {{K_{j + 1}^{b}\left\lbrack {0\mspace{14mu}\Theta_{j}^{j}\mspace{14mu}\ldots\mspace{14mu}\Theta_{1}^{j}\mspace{14mu} I} \right\rbrack} + \left\lbrack {I\mspace{14mu}\Phi_{1}^{j}\mspace{14mu}\ldots\mspace{14mu}\Phi_{j}^{j}\mspace{14mu} 0} \right\rbrack} \end{matrix} \right\rbrack C^{j + 1}}}$

From (18), it can be seen that:

$\begin{matrix} {\begin{bmatrix} {\sum\limits_{j}^{b}\;} & 0 & \ldots & 0 & \beta \\ \alpha & 0 & \ldots & 0 & {\sum\limits_{j}^{f}\;} \end{bmatrix} = {\begin{bmatrix} I & \Phi_{1}^{j} & \ldots & \Phi_{j}^{j} & 0 \\ 0 & \Theta_{j}^{j} & \ldots & \Theta_{1}^{j} & I \end{bmatrix}{C^{j + 1}.}}} & (19) \end{matrix}$ where: α=Θ_(j) ^(j) R(1)+ . . . +Θ₁ ^(j) R(j)+R(j+1),  (20a) β=Θ_(j) ^(j) R(−1)+ . . . +Θ₁ ^(j) R(−j)+R(−(j+1)).  (20b)

It can also be proven that: β=α^(T).  (21)

From (16), it can be seen that α+K_(j+1) ^(f)Σ_(j) ^(b)=0 and β+K_(j+1) ^(b)Σ_(j) ^(f)=0, and therefore: K _(j+1) ^(f)=−α(Σ_(j) ^(b))⁻¹,  (22a) K _(j+1) ^(b)=−β(Σ_(j) ^(f))⁻¹.  (22b)

Comparing the left-side of (17) with the right-side of (18), it can be seen that: Θ_(i) ^(j+1)=Θ_(i) ^(j) +K _(j+1) ^(f)φ_(j−i+1) ^(j) 1≦i≦j  (23a) φ_(i) ^(j+1)=φ_(i) ^(j) +K _(j+1) ^(b)Θ_(j−i+1) ^(j) 1≦i≦j  (23b)

From (16), it can be seen that: Σ_(j+1) ^(f)=Σ_(j) ^(f) +K _(j+1) ^(f)β  (24a) Σ_(j+1) ^(b)=Σ_(j) ^(b) +K _(j+1) ^(b)α  (24b)

Noting that α=−K_(j+1) ^(f)Σ_(j) ^(b) and β=−K_(j+1) ^(b)Σ_(j) ^(f), (24a) and (24b) can be rewritten as: Σ_(j+1) ^(f)=(I−K _(j+1) ^(f) K _(j+1) ^(b))Σ_(j) ^(f),  (25a) Σ_(j+1) ^(b)=(I−K _(j+1) ^(b) K _(j+1) ^(f))Σ_(j) ^(b).  (25b)

Similarly, by comparing the left-side of (17) with the right-side of (18), it can be seen that: Θ_(j+1) ^(j+1) =K _(j+1) ^(f),  (26a) φ_(j+1) ^(j+1) =K _(j+1) ^(b).  (26b)

Substituting (21) into (22b) and (26) into (22), (23), and (25) results in the equations of recursion (27), as follows:

${Recursions}:{1\underset{\_}{<}j\underset{\_}{<}{p - {1\begin{matrix} {\alpha = {{\Theta_{j}^{j}{R(1)}} + \ldots + {\Theta_{1}^{j}{R(j)}} + {R\left( {j + 1} \right)}}} & \left( {27a} \right) \\ {\Theta_{j + 1}^{j + 1} = {- {\alpha\left( \sum\limits_{j}^{b} \right)}^{- 1}}} & \left( {27b} \right) \\ {\Phi_{j + 1}^{j + 1} = {- {\alpha^{T}\left( \sum\limits_{j}^{f} \right)}^{- 1}}} & \left( {27c} \right) \\ {\Theta_{i}^{j + 1} = {{\Theta_{i}^{j} + {\Theta_{j + 1}^{j + 1}\Phi_{j - i + 1}^{j}\mspace{14mu} 1}}\underset{\_}{<}i\underset{\_}{<}j}} & \left( {27d} \right) \\ {\Phi_{i}^{j + 1} = {{\Phi_{i}^{j} + {\Phi_{j + 1}^{j + 1}\Theta_{j - i + 1}^{j}\mspace{14mu} 1}}\underset{\_}{<}i\underset{\_}{<}j}} & \left( {27e} \right) \\ {{\sum\limits_{j + 1}^{f}\;{= {\left( {I - {\Theta_{j + 1}^{j + 1}\Phi_{j + 1}^{j + 1}}} \right)\sum\limits_{j}^{f}}}}\;} & \left( {27f} \right) \\ {{\sum\limits_{j + 1}^{b}\;{= {\left( {I - {\Phi_{j + 1}^{j + 1}\Theta_{j + 1}^{j + 1}}} \right){\sum\limits_{j}^{b}.}}}}\;} & \left( {27g} \right) \end{matrix}}}}$

The initial conditions for the recursion (28) can be derived from the zeroth-order forward and backward predictors, as follows:

${Initial}\mspace{14mu}{{Conditions}:\begin{matrix} {\Theta_{1}^{1} = {{- {R(1)}}\left( {R(0)} \right)^{- 1}}} & \left( {28a} \right) \\ {\Phi_{1}^{1} = {{- {R\left( {- 1} \right)}}\left( {R(0)} \right)^{- 1}}} & \left( {28b} \right) \\ {\sum\limits_{1}^{f}\;{= {\left( {I - {\Theta_{1}^{1}\Phi_{1}^{1}}} \right){R(0)}}}} & \left( {28c} \right) \\ {\sum\limits_{1}^{b}\;{= {\left( {I - {\Phi_{1}^{1}\Theta_{1}^{1}}} \right){R(0)}}}} & \left( {28d} \right) \end{matrix}}$

Upon the completion of recursions, the TEQ coefficients are obtained directly from the relationship shown in (7), that is, the first entry of matrix Θ_(i) ^(p), 1≦i≦p,

$\begin{matrix} \begin{matrix} {{w_{0} = {a_{0}^{p} = 1}},} & \; \\ {w_{i} = {a_{i}^{p} = {\Theta_{i}^{p}\left( {1,1} \right)}}} & {1\underset{\_}{<}i\underset{\_}{<}p} \end{matrix} & (29) \end{matrix}$

For the case where p=q, that is, δ=0, the TC-AR model takes a slightly different form. Defining

${z_{k}\begin{bmatrix} y_{k} \\ x_{k} \end{bmatrix}},$ the input-output relationship is now:

$\begin{matrix} {z_{k} = {{{- \Theta_{1}^{j}}z_{k - 1}} - \ldots - {\Theta_{j}^{j}z_{k - j}} + {\begin{bmatrix} 1 & b_{0}^{j} \\ 0 & 1 \end{bmatrix}\begin{bmatrix} {{\mathbb{e}}^{\prime}}_{k}^{j} \\ x_{k} \end{bmatrix}}}} & (30) \end{matrix}$

${{{where}\mspace{14mu} - \Theta_{i}^{j}} = \begin{bmatrix} {- a_{i}^{j}} & b_{i}^{j} \\ 0 & 0 \end{bmatrix}},{1\underset{\_}{<}i\underset{\_}{<}{j.}}$

The same recursions of (27) with the initial conditions of (28) can be used to estimate the model parameters. At the end of recursion, we have {a₁ ^(p), . . . , a_(p) ^(p), b₁ ^(p), . . . , b_(p) ^(p)} estimated with b₀ ^(p) determined as b₀ ^(p)=Σ_(p) ^(f)(1, 2)/R_(xx)(0). The TEQ coefficients are obtained in the same manner as in (29).

For the case where p<q, the roles of the input and output need to be interchanged in order to use the recursions of (27) to estimate the model parameters. Specifically, the two-channel vector

$z_{k} = \begin{bmatrix} x_{k} \\ y_{k + \delta} \end{bmatrix}$ where in this case δ=q−p. The TC-AR model is then:

$\begin{matrix} {z_{k} = {{{{- \Theta_{1}^{j}}z_{k - 1}} - \ldots - {\Theta_{\delta - 1}^{j}z_{k - \delta - 1}} - {\Theta_{\delta}^{j}z_{k - \delta}} - \ldots - {\Theta_{j}^{j}z_{k - j}} + \begin{bmatrix} {{\mathbb{e}}^{''}}_{k}^{j} \\ y_{k + \delta} \end{bmatrix}} = {{\hat{z}}_{k} + e_{k}^{''}}}} & (31) \end{matrix}$ where:

$\begin{matrix} {{- \Theta_{i}^{j}} = \left\{ \begin{matrix} {\begin{bmatrix} {- b_{i}^{j}} & 0 \\ 0 & 0 \end{bmatrix},} & {1\underset{\_}{<}i\underset{\_}{<}{\delta - 1}} \\ {\begin{bmatrix} {- b_{i}^{j}} & a_{i - \delta}^{j} \\ 0 & 0 \end{bmatrix},} & {\delta\underset{\_}{<}i\underset{\_}{<}{j.}} \end{matrix} \right.} & (32) \end{matrix}$

Since z_(k) is defined differently, the correlation matrix now takes the form of:

$\begin{matrix} {{R(i)} = {{E\left\lbrack {z_{k}z_{k - i}^{T}} \right\rbrack} = {\begin{bmatrix} {R_{xx}(i)} & {R_{xy}\left( {i - \delta} \right)} \\ {R_{yx}\left( {i + \delta} \right)} & {R_{yy}(i)} \end{bmatrix} = {{R^{T}\left( {- i} \right)}.}}}} & (33) \end{matrix}$

The recursions of (27) with the same initial conditions of (28) are directly applicable to this case for estimating {a₁ ^(q), . . . , a_(p) ^(q), b₁ ^(q), . . . , b_(q) ^(q)} with b₀ ^(q)=0. The TEQ coefficients are then obtained as: w _(i) =a _(i) ^(q)=Θ_(i+δ) ^(q)(1, 2) 0≦i≦p  (34)

Estimating the TC-AR model parameters requires the computation of correlation matrices defined either in (10) for p≧q or (33) for p<q. These correlation quantities can be computed directly from the received time-domain DMT signal and the known input signal during training. In order to suppress noise, correlation sequences may be computed repeatedly over many DMT symbols and then averaged. Alternatively, the impulse response of the physical channel can be estimated from the received DMT signal and the known input signal via FFT/IFFT, and the impulse response can be directly assigned to the correlation sequences.

As described before, the time-domain DMT input sequence is approximately white. Therefore,

${{{R_{xx}(l)} = {S_{x}\delta_{l}}};{{R_{yy}(l)} = {S_{x}{\sum\limits_{m = 0}^{L}\;{h_{m}h_{m - l}}}}};{{R_{yx}(l)} = {{S_{x}h_{l}} = {R_{xy}\left( {- l} \right)}}}},$ where S_(x) is the average energy of the input signal.

ARMA model order selection is an important component in generating an accurate approximation of the channel. Subscriber loops in typical carrier service areas have different lengths and configurations (such as the existence or non-existence of bridged-taps). In order to more precisely model the impulse responses of subscriber loops, models with different pole-zero orders may be used. However, the subscriber loop characteristics are not known to the ATU before a connection is established, so the model order for a particular subscriber loop is not known a priori. Therefore, the model order must be determined dynamically.

In an ARMA model, the model order is determined by the number of zeros and the number of poles. The number of zeros (q) is determined according to the CP length, such that q≦(v+1). The number of poles (p) is constrained by the complexity of the system. If either is too large, then TEQ performance degrades. Therefore, they should not be made arbitrarily long.

Various tests have been proposed in the literature for determining p and q from the output sequence y_(k) or from its autocorrelation function R_(yy) [10]. One approach, discussed in [7] and [8], utilizes a “brute force” approach. In this approach, maximum allowable complexity is assumed, which in turn sets the upper bound for values of p and q. Next, an exhaustive search is performed over different choices of p and q, within the allowable range, until an acceptable approximation is found based upon some predetermined statistic, which is typically the mean squared error (MSE), although other statistics such as signal to total distortion ratio (STDR) or shortened signal to noise ratio (SSNR) may alternatively be used. However, it is important to emphasize that increasing p and/or q could result in a worse approximation, depending on the channel impulse response h_(k). This “brute force” approach is too time consuming and complex for certain implementations.

An embodiment of the present invention reduces the number of searches needed to obtain the TEQ coefficients by only performing searches for (p−q)=x. Typically, no search is performed for p=0. This is possible because the parameters of a low-order TC-AR model can be obtained from those of a high-order model if the order differential between the pole and zero polynomials remains the same.

Consider the forward prediction coefficient matrix defined in (7) for the fixed value of δ, that is, the order differential between the pole polynomial and the zero polynomial. In each new recursion, one more prediction coefficient matrix is obtained while the matrices obtained in previous recursions are updated. Therefore, as in the single-channel AR modeling case, the prediction coefficients of a low-order model can be obtained from a high-order model. To provide reasonable performance, the lowest order may be limited to the one at which the predetermined statistic (e.g., MSE) is below a predefined level.

In such an embodiment, searching starts from a high-order model with a fixed δ, for example, the model with 32 poles for the case p≧q or 32 zeros for the case p<q. Then correlation matrices are computed according to the high-order model and the multichannel Levinson algorithm is applied in order to determine TEQ coefficients for the particular value of δ. At each recursion j, {a₁ ^(j+1), . . . , a_(j+1) ^(j+1)} is stored, with a₀ ^(j+1)=1 for p≧q or {a₀ ^(j+1), . . . , a_(j+1−δ) ^(j+1)}for p<q.

Multiple sets of TEQ coefficients are obtained after completing all recursions. For each set of TEQ coefficients, the received time-domain DMT sample sequence is equalized by a TEQ with the number of taps equal to the number of coefficients in that set of TEQ coefficients. The output sequence is shifted back by a fixed amount and then transformed to frequency domain via FFT. The input signal information is removed from the frequency samples, and then the IFFT is performed. The resulting time-domain sequence with a length of 2N samples yields the impulse response of the shortened channel. A sliding window with a length of (v+1) is then used to find the (v+1) most powerful consecutive samples. A shortened signal to noise ratio (SSNR) is then computed. The SSNR is defined as the ratio of impulse response coefficient energy within a predetermined window to that outside the window, as follows:

${SSNR} = \frac{\sum\limits_{i = a}^{a + v}\; r_{i}^{2}}{\begin{matrix} {{\sum\limits_{i = 0}^{a - 1}\; r_{i}^{2}} +} & {\underset{i = {a + v + 1}}{\overset{{2N} - 1}{\sum r_{i}^{2}}}\;} \end{matrix}}$ where a is the number of the first sample of the (v+1) most powerful consecutive samples.

After a SSNR is obtained for each set of TEQ coefficients, the set of TEQ coefficients providing the highest SSNR is selected as the final TEQ coefficients.

Thus, an embodiment of the present invention computes TEQ coefficients for each (p−q)=x(p≠0), finds the combined (shortened) channel for each such (p−q)=x, computes the SSNR for each such (p−q)=x, and selects the TEQ coefficients providing the best SSNR.

FIG. 12 is a logic flow diagram showing exemplary logic 1200 for determining TEQ coefficients using, the multichannel Levinson algorithm for ARMA modeling, wherein the coefficients for a low-order model are obtained from a high-order model. Beginning at step 1202, the logic computes TEQ coefficients for a new (previously untested) value of (p−q)=x, in step 1204. The logic preferably does not test the case for p=0. The logic then finds the combined (shortened) channel resulting from the TEQ coefficients, in step 1206. The logic then computes the SSNR for the shortened (combined) channel, in step 1208. If the logic has completed all (p−q) combinations to be tested (YES in step 1210), then the logic selects the TEQ coefficients providing the best SSNR, in step 1212. If the logic has not completed all (p−q) combinations to be tested (NO in step 1210), then the logic recycles to step 1204 to test (p−q) combination. The logic 1200 terminates in step 1299.

When used specifically for determining TEQ coefficients for ADSL channels (as opposed to other types of communication channels), the number of searches can be reduced even further by searching only (p−q)=x for values of x>0 (i.e., p>q). This is possible because simulation results have shown that, at least for ADSL, the TEQ coefficients for values of q≧p are worse than the TEQ coefficients for values of p>q.

FIG. 13 is a logic flow diagram showing exemplary logic 1300 for determining TEQ coefficients using the multichannel Levinson algorithm for ARMA modeling, wherein the coefficients for a low-order model are obtained from a high-order model and wherein only combinations of (p−q) for p>q are tested. Beginning at step 1302, the logic computes TEQ coefficients for a new, previously untested, value of (p−q)=x, where p>q, in step 1304. The logic preferably does not test the case for p=0. The logic then finds the combined (shortened) channel resulting from the TEQ coefficients, in step 1306. The logic then computes the SSNR for the shortened (combined) channel, in step 1308. If the logic has completed all (p−q) combinations to be tested (YES in step 1310), then the logic selects the TEQ coefficients providing the best SSNR, in step 1312. If the logic has not completed all (p−q) combinations to be tested (NO in step 1310), then the logic recycles to step 1304 to test (p−q) combination. The logic 1300 terminates in step 1399.

The number of searches can be further reduced by searching only for p≧p_(min), where p_(min) is a predetermined minimum value for p (e.g., p≧8). This is possible because, practically speaking, the model order will generally have at least the predetermined minimum number of poles, and therefore there is no need to search model orders having fewer poles.

FIG. 14 is a logic flow diagram showing exemplary logic 1400 for determining TEQ coefficients using the multichannel Levinson algorithm for ARMA modeling, wherein the coefficients for a low-order model are obtained from a high-order model and wherein only combinations of (p−q) for p>q and p≧p_(min) are tested. Beginning at step 1402, the logic computes TEQ coefficients for a new, previously untested, value of (p−q)=x, where p>q and p≧p_(min), in step 1404. The logic preferably does not test the case for p=0. The logic then finds the combined (shortened) channel resulting from the TEQ coefficients, in step 1406. The logic then computes the SSNR for the shortened (combined) channel, in step 1408. If the logic has completed all (p−q) combinations to be tested (YES in step 1410), then the logic selects the TEQ coefficients providing the best SSNR, in step 1412. If the logic has not completed all (p−q) combinations to be tested (NO in step 1410), then the logic recycles to step 1404 to test (p−q) combination. The logic 1400 terminates in step 1499.

In theory, the TC-AR model is guaranteed to be stable, since the multichannel Levinson algorithm is used to estimate the model parameters [8]. In a practical implementation, however, the TC-AR model, and, equivalently, the pole-zero ARMA model, can be unstable because of numerical errors.

It can be observed from the recursions of (27) that the computations of α, Θ_(j+1) ^(j+1) and φ_(j+1) ^(j+1), are error-prone. This is because, in computing α, successive matrix multiplications and accumulations are carried out. Small errors in the parameters obtained from the previous recursion manifest themselves into significant errors in the new recursion. The computations of Θ_(j+1) ^(j+1) and φ_(j+1) ^(j+1), on the other hand, involve matrix inversion. As the diagonal entries of Σ_(j) ^(f) and Σ_(j) ^(b) become small in magnitude, inversions of these matrices yield large-value entries. Again, any small error in the original matrix results in a large error in the inverted one.

Simulations have confirmed that, with very high signal-to-noise ratio (SNR), an unstable model can be obtained with a floating-point implementation. Thus, it is desirable to detect such instabilities during recursions.

Consider the case p≧q. From (11a), it can be seen that:

$\begin{matrix} {\sum\limits_{j}^{f}\;{= {{E\left\lbrack {{\mathbb{e}}_{k}{\mathbb{e}}_{k}^{T}} \right\rbrack} = {\begin{bmatrix} {E\left\{ {\mathbb{e}}_{k}^{2j} \right\}} & {E\left\{ {{\mathbb{e}}_{k}^{j}x_{k + \delta}} \right\}} \\ {E\left\{ {x_{k + \delta}{\mathbb{e}}_{k}^{j}} \right\}} & {E\left\{ x_{k + \delta}^{2} \right\}} \end{bmatrix}.}}}} & (35) \end{matrix}$

The upper diagonal entry of Σ_(j) ^(f) represents the mean square error of forward prediction. As the recursion value j increases, the mean square error decreases monotonically for a stable model.

Also, (27f) can be rewritten as: Σ_(j+1) ^(f)=(I−Θ _(j+1) ^(j+1)φ_(j+1) ^(j+1))Σ_(j) ^(f).  (36)

From (7), it can be observed that Θ_(j+1) ^(j+1) has non-zero entries on the first row. It can also be shown that φ_(j+1) ^(j+1) has non-zero entries on the first column. The product of these two matrices has a non-zero entry only at the upper diagonal location, which is the only non-zero eigenvalue of the product matrix. For convenience, this eigenvalue is denoted as λ₁. Thus, the ARMA model is stable (i.e., the pole polynomial has all its roots inside the unit circle in the z-plane or in other words, it has a minimum-phase response) if: 0<λ₁<1.  (37)

Equivalently, the matrix (I−Θ_(j+1) ^(j+1)φ_(j+1) ^(j+1)) shall be positive definite to ensure the stability of the model. Also, its eigenvalues shall not be larger than 1. Once this condition is met, it can be seen from (36) that the mean square error decreases monotonically.

Thus, in an exemplary embodiment of the invention, the training logic uses (37) to check whether or not the model is stable. Specifically, the training logic monitors the value of λ₁ during each recursion to verify that the 0<λ₁<1 condition is met. If the model becomes unstable during a particular recursion (i.e., the 0<λ₁<1 condition is not met), that recursion is terminated, since any values that would be obtained thereafter would be unreliable.

FIG. 15 is a logic flow diagram showing exemplary logic 1500 for detecting instability during an iteration of the multichannel Levinson algorithm for ARMA modeling. Beginning at step 1502, the logic performs a recursion of the multichannel Levinson algorithm for ARMA modeling, in step 1504. The logic then examines the value of λ₁, in step 1506. If 0<λ₁<1 (YES in step 1508), then the logic recycles to step 1504 for a next recursion. Otherwise (NO in step 1508), then logic terminates the iteration. The logic 1500 terminates in step 1599.

An alternative is to add a small constant to the upper diagonal entry of Σ_(j) ^(f) whenever its value reaches a low threshold. In this way, the stability of the model can be ensured for the current recursion. Similar correction can be done to Σ_(j) ^(b). Although this correction is useful for obtaining a higher-order stable model, instability of the model with a much higher-order cannot be completely avoided. Furthermore, with the correction, the mean square error may not decrease monotonically around the correction point.

The multichannel Levinson algorithm for ARMA modeling can be expressed in a slightly different form. In this summary of the multichannel Levinson algorithm, it is assumed that the number of poles (p) is greater than the number of zeros (q), i.e. δ>0. The algorithm takes a slightly different form for cases p=q and p<q, as will be apparent to a skilled artisan. However, since the best results are expected when p>q for time-domain equalization of ADSL channels, the algorithm is not described for the p≦q cases for convenience.

The error sequence e_(k) from (1) can be rewritten as:

$\begin{matrix} {{\mathbb{e}}_{k} = {{y_{k} - {\hat{y}}_{k}} = {{- {\sum\limits_{i = 0}^{q}\;{b_{i}x_{k - i}}}} + {\sum\limits_{i = 0}^{p}\;{a_{i}y_{k - i}}}}}} & (38) \end{matrix}$

Introducing the following definitions:

$\begin{matrix} {{u_{k} = \begin{bmatrix} y_{k} \\ x_{k - \delta} \end{bmatrix}},\mspace{14mu}{{\mathbb{e}}_{k}\begin{bmatrix} {\mathbb{e}}_{k} \\ x_{k - \delta} \end{bmatrix}},{A_{i} = \left\{ \begin{matrix} \begin{bmatrix} a_{i} & 0 \\ 0 & 0 \end{bmatrix} & {1\underset{\_}{<}i < \delta} \\ \begin{bmatrix} a_{i} & {- b_{i - \delta}} \\ 0 & 0 \end{bmatrix} & {\delta\underset{\_}{<}i\underset{\_}{<}p} \end{matrix} \right.}} & (39) \end{matrix}$ where δ=p−q, (38) can be rewritten as:

$\begin{matrix} {{{\mathbb{e}}_{k} = {{u_{k} - {\hat{u}}_{k}} = {\sum\limits_{i = 0}^{p}\;{A_{i}u_{k - i}}}}},{{{with}\mspace{14mu} A_{0}} = {I = \begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}}}} & (40) \end{matrix}$

It should be noted that (40) is similar to the AR (all pole) equation except that the variables are not scalers. Now, following the procedure in deriving the recursive Levinson algorithm for the linear predictor, one can derive its multichannel (two-channel in this case) version. The following analagous variable applicable to the multichannel case are defined:

A_(i) ^(j), i=0, 1, . . . , j, j-th recursion forward predictor coefficients

${e_{k}^{j} = {\sum\limits_{i = 0}^{j}\;{A_{i}^{j}u_{k - i}}}},\mspace{14mu}{A_{0}^{j} = I},$ j-th recursion forward prediction error E_(f) ^(j)=E{e_(k) ^(j)(e_(k) ^(j))^(T)}, j-th recursion mean square forward prediction error B_(i) ^(j), i=0, 1, . . . , j, j-th recursion backward predictor coefficients

${w_{k}^{j} = {\sum\limits_{i = 1}^{j + 1}\;{B_{j - i + 1}^{j}u_{k - i}}}},\mspace{14mu}{B_{0}^{j} = I},$ j-th recursion backward prediction error E_(b) ^(j)=E{w_(k) ^(j)(w_(k) ^(j))^(T)}, j-th recursion mean square backward prediction error

${R(i)} = {{E\left\{ {u_{k}\left( u_{k} \right)}^{T} \right\}} = {{E\left\{ \begin{bmatrix} {y_{k}y_{k - i}} & {y_{k}x_{k + \delta - i}} \\ {x_{k + \delta}y_{k - i}} & {x_{k + \delta}x_{k + \delta - i}} \end{bmatrix} \right\}} = {\begin{bmatrix} {R_{yy}(i)} & {R_{yx}\left( {i - \delta} \right)} \\ {R_{xy}\left( {i + \delta} \right)} & {R_{xx}(i)} \end{bmatrix} = {R^{T}\left( {- i} \right)}}}}$

The multichannel Levinson algorithm can now be described as follows:

Initial conditions: A₀ ⁰=I B₀ ⁰=I E_(f) ⁰=R(0) E_(b) ⁰=R(0) Algorithm, j+0, 1, . . . , p−1: Using A_(i) ^(j), B_(i) ^(j) for i=0, 1, . . . , j, and E_(f) ^(j), E_(b) ^(j) calculate:

$\begin{matrix} \begin{matrix} {{\alpha^{j} = {\sum\limits_{i = 0}^{j}\;{A_{i}^{j}{R\left( {j - i + 1} \right)}}}},{A_{0}^{j} = I}} \\ {{\beta^{j} = {\sum\limits_{i = 1}^{j + 1}\;{B_{j - i + 1}^{j}{R\left( {- i} \right)}}}},{B_{0}^{j} = I}} \\ {K_{f}^{j} = {- {\alpha^{j}\left( E_{b}^{j} \right)}^{- 1}}} \\ {K_{b}^{j} = {- {\beta^{j}\left( E_{f}^{j} \right)}^{- 1}}} \end{matrix} & (41) \end{matrix}$ Calculate forward and backward prediction coefficients and MS errors: A ₀ ^(j+1) =I A _(i) ^(j+1) =A _(i) ^(j) +K _(f) ^(j) B _(j−i+1) ^(j), i=1, 2, . . . , j A _(j+1) ^(j+1) =K _(f) ^(j) B ₀ ^(j+1) =I B_(i) ^(j+1) =B _(i) ^(j) +K _(b) ^(j) A _(j−i+1) ^(j), i=1, 2, . . . , j B _(j+1) ^(j+1) =K _(b) ^(j) E _(f) ^(j+1)=(I−K _(f) ^(j) K _(b) ^(j))E _(f) ^(j) E _(b) ^(j+1)=(I−K _(b) ^(j) K _(f) ^(j))E _(b) ^(j)

Variables α^(j), β^(j), K_(f) ^(j), and K_(b) ^(j), are only auxiliary variables. The last two variables are also known as the partial correlation coefficients. After p iterations, the model parameters are determined from the forward prediction coefficients (39).

Since the recursive procedures of the multichannel Levinson algorithm are carried out with 2×2 matrix operations, the multichannel Levinson algorithm is computationally efficient, particularly in comparison to the UTCb model. The multichannel Levinson algorithm is also robust with respect to numerical errors, and requires relatively low memory storage. However, use of the multichannel Levinson algorithm to solve the parameters of the TC-AR model implies certain compromises, as discussed throughout the remainder of the specification.

In an embodiment of the present invention, the parameters of the pole polynomial are directly applied to the TEQ, which forms a FIR filter. Specifically, the description of the ARMA model (3) can be rewritten as:

$\begin{matrix} {{\hat{H}(z)} = {\frac{\sum\limits_{k = 0}^{q}\;{b_{k}z^{- k}}}{1 + {\sum\limits_{k = 1}^{p}\;{a_{k}z^{- k}}}} = {\frac{B(z)}{1 + {A(z)}} \approx {H(z)}}}} & (42) \end{matrix}$

If the TEQ, W(z), is the denominator polynomial in (42), that is: W(z)=1+A(z)  (43) then:

$\begin{matrix} {{{H(z)}{W(z)}} = {{{{H(z)}\left\lbrack {1 + {A(z)}} \right\rbrack} \approx {\frac{B(z)}{1 + {A(z)}}\left\lbrack {1 + {A(z)}} \right\rbrack}} = {B(z)}}} & (44) \end{matrix}$

In other words, the equivalent (combined) channel made up of the original channel H(z) and the TEQ, W(z), in cascade, can be approximated by an FIR filter with the taps equal to the coefficients of the numerator polynomial, B(z). Consequently, the length of the combined channel approximately equals q, the number of zeros in the ARMA model. Therefore, if an accurate modeling is achieved with q≦v+1, then ISI and ICI are avoided, and orthogonality between subchannels is achieved.

It should be noted from (43) that the first TEQ coefficient is always one. This provides a non-trivial solution, but offers, for the same TEQ length, fewer degrees of freedom compared to the UTCb model. For example, in the TC-AR model, there is no way to conveniently choose Δ, as there is in the UTCb model. Also, with the first TEQ tap always being unity, the resulting combined impulse response, b_(k), tends to have the bulk of its energy located around the same position as the bulk of the energy for the original channel, h_(k). This is not always optimal, especially when the original channel impulse response is spread. For the purpose of effective modeling, the first Δ₁ samples corresponding to the “flat” part of precursor were not taken into account.

Consider, for example, the upstream and downstream channels in an exemplary G.lite system that is simulated using ANSI loop #7 with G.lite transceiver filters and white background noise at −100 dBm/Hz. Even though the upstream and downstream signals occupy different frequency bands, rather sharp filtering is still required in order to completely separate upstream and downstream signals. Because the transceiver filters are considered to be part of the overall channel, the channel impulse response strongly depends upon their design. Since an upstream channel requires a steeper filter slope than a downstream channel, the total channel impulse response is more spread for the upstream channel than for the downstream channel. Consequently, a longer TEQ is used for the upstream channel than for the downstream channel (e.g., 32 taps for the downstream and 64 taps for the upstream).

When modeled using the reference UTCb model with the heuristically found delay Δ₂, the TEQ provides effective equalization for both the upstream channel and the downstream channel. FIG. 16( a) shows the compressed (shortened) impulse response obtained from the UTCb model on the simulated downstream G.lite channel, with the compressed (shortened) impulse response shown as a solid line and the original channel impulse response shown as a dotted line. FIG. 16( b) shows the compressed (shortened) impulse response obtained from the UTCb model on the simulated upstream G.lite channel, with the compressed (shortened) impulse response shown as a solid line and the original channel impulse response shown as a dotted line.

When modeled using the TC-AR model, the TEQ provides effective equalization for the downstream channel but not for the upstream channel. FIG. 17 (a) shows the compressed (shortened) impulse response obtained from the TC-AR model on the simulated downstream G.lite channel, with the compressed (shortened) impulse response shown as a solid line and the original channel impulse response shown as a dotted line. FIG. 17( b) shows the compressed (shortened) impulse response obtained from the TC-AR model on the simulated upstream G.lite channel, with the compressed (shortened) impulse response shown as a solid line and the original channel impulse response shown as a dotted line.

In order to better measure and evaluate the achieved performance of the TC-AR model, a signal to total distortion ratio (STDR) is calculated. The total distortion is calculated as the sum of the combined ISI/ICI term and the noise terms, as measured at the slicer input. STDR is calculated as a function of the subcarrier order number, and is defined as follows:

$\begin{matrix} {{{STDR}_{k} = \frac{S_{R,k}}{S_{I,k} + N_{k}}},} & (45) \end{matrix}$ where S_(R,k) is the receive signal discrete power spectrum, S_(I,k) is the discrete power spectrum of combined ISI/ICI due to insufficient channel shortening, and N_(k) is the discrete power spectrum of the noise component. Index k represents the subcarrier order number. It is assumed that the samples of a non-ideally shortened channel impulse response do not span more than one DMT symbol interval (N₂ samples). Those samples are denoted by p_(k), and are divided into three groups: h=[h₀, h₁, . . . , h_(H−1)]^(T), head, b=[b₀, b₁, . . . , b_(v)]^(T), body (most powerful v+1 consecutive samples), t=[t₀, t₁, . . . , t_(T−1)]^(T), tail, so that: p=[p ₀ , p ₁ , . . . , p _(N) ₂ ⁻¹]^(T) =[h ₀ , h ₁ , . . . , h _(H−1) , b ₀ , b ₁ , . . . , b _(v) t ₀ , t ₁ , . . . , t _(T−1)]^(T)  (46) with the constraint, H+v+1+T=N₂, as shown in FIG. 18. A rotated version of p, p′, is used as well, and its definition is: p′=[p′ ₀ , p′ ₁ , . . . , p′ _(N) ₂ ⁻¹]^(T) =[b ₀ , b ₁ , . . . , b _(v) t ₀ , t ₁ , . . . , t _(T−1) , h ₀ , h ₁ , . . . , h _(H−1)]^(T)  (47) Then, the components of (45) are computed as follows:

$\begin{matrix} {S_{R,k} = {S_{X}{{\sum\limits_{i = 0}^{N_{2} - 1}\;{p_{i}^{*}{\mathbb{e}}^{{- j}\frac{2\;\pi}{N_{2}}{ik}}}}}^{2}}} \\ \left. {{S_{I,k} = {\frac{2}{N_{2}}{S_{X}\left\lbrack {{\sum\limits_{m = 0}^{T - 1}\;{{\sum\limits_{i = 0}^{T - 1 - m}\;{t_{i + m}{\mathbb{e}}^{{- j}\frac{2\;\pi}{N_{2}}{ik}}}}}^{2}} + \sum\limits_{m = 0}^{H - 1}}\;  \right.}{\sum\limits_{m = 0}^{H - 1 - m}\;{h_{i}{\mathbb{e}}^{{- j}\frac{2\;\pi}{N_{2}}{({N_{2} - H + i + m})}k}}}}}}^{2} \right\rbrack \\ {N_{k} = {\frac{S_{N,k}}{\left( {2L} \right)^{2}}{\sum\limits_{i = {- L}}^{L}\;{{{W\left( {\mathbb{e}}^{j\frac{\pi}{L}i} \right)}}^{2}\frac{\sin^{2}\left\lbrack {\pi\left( {k - {\frac{N_{2}}{2L}i}} \right)} \right\rbrack}{\sin^{2}\left\lbrack {\pi\left( {\frac{k}{N_{2}} - \frac{i}{2L}} \right)} \right\rbrack}}}}} \end{matrix}$ where, S_(x) is the transmit (flat) power spectrum, calculated from the power spectrum density specification provided in the ADSL recommendations (−40 dBm/Hz for the downstream and −38 dBm/Hz for the upstream); S_(N,k)=S_(N) is the discrete noise power spectrum, assumed flat for the purposes of this discussion; W(e^(jω)) is the Fourier transform of the discrete TEQ filter; 2L is the number of segments used for numerical evaluation of the integral, that was originally in the expression for N_(k); N₂=2N, where N is the number of subcarriers in the observed system, as stated before.

FIG. 19( a) shows the STDR obtained on the simulated downstream G.lite channel, with the STDR obtained from the TC-AR model shown as a solid line and the STDR obtained from the UTCb model shown as a dotted line. FIG. 19( b) shows the STDR obtained on the simulated upstream G.lite channel, with the STDR obtained from the TC-AR model shown as a solid line and the STDR obtained from the UTCb model shown as a dotted line.

It is clear to see from FIG. 19( b) that straight application of generalized ARMA modeling does not produce satisfactory results in the upstream case. This may be explained as a consequence of several unfavorable facts. First, the number of zeros in the model is restricted to be less than or equal to the cyclic prefix. In the case of the upstream direction, v=4, whereas in the case of the downstream, v=16 (G.lite) or v=32 (G.dmt). Therefore, the upstream channel modeling is even more restricted, and it cannot be simply compensated by increasing the number of poles. Second, modeling the upstream channel with larger spread suffers even more from lacking the possibility to model delay, Δ, associated with the optimum MSE, as is done in the UTCb model.

Two-Pass Shortening Model for Modeling the Upstream Channel

Therefore, in an embodiment of the present invention, the upstream channel is modeled in two passes, first for the original channel, and then to the mirrored (reversed in time with appropriate delay) image of the result of the first pass. Specifically, the multichannel Levinson algorithm is first applied to the bulk of the original channel (the precursor samples are preferably discarded) with slightly relaxed requirements for shortening, such that the number of zeros in the first pass, q₁, is greater than the CP length (i.e., q₁>v). The multichannel Levinson algorithm is then applied to the mirrored (reversed in time with appropriate delay) image of the result of the first pass. In this second pass, however, the number of zeros, q₂, is not permitted to exceed the CP length (i.e., q₂≦v). The TEQ parameters are obtained by cascading the filter corresponding to the denominator polynomial of the first pass modeling and the filter corresponding to the mirrored image of the denominator polynomial of the second pass modeling. The number of poles in the first and second pass, p₁ and P₂, respectively, are chosen such that p₁+p₂+1=M, where M is the TEQ filter length.

An exemplary embodiment of the invention utilizes the heuristically determined values p₁≈2M/3, q₁₌2v, p₂≈M/3, and q2=v for the two-pass shortening model.

The result of the two-pass shortening can be expressed in the following form: H _(shortened)(z)=H(z)[1+A _(i)(z)][1+A ₂(z ⁻¹)]z ^(−(p) ² ⁺¹⁾  (48) where H(z) is the original channel, [1+A₁(z)] is the denominator polynomial of the first pass modeling, and [1+A₂(z⁻¹)]z^(−(p) ₂+1) is the denominator polynomial of the second pass modeling, reversed in time and delayed. H_(shortened)(z) can be evaluated based on the following expressions:

$\begin{matrix} {{{H\text{(z)}} \approx \frac{B_{1}(z)}{1 + {A_{1}(z)}}},{{first}\mspace{14mu}{pass}}} & (49) \end{matrix}$ H ₁ ^(mirror)(z)=H ₁(z ⁻¹)z ^(−N) ² ^(, H) ₁(z)=H(z)[1+A ₁(z)], input to second pass  (50)

$\begin{matrix} {{{H_{1}^{mirror}(z)} \approx \frac{B_{2}(z)}{1 + {A_{2}(z)}}},{{second}\mspace{14mu}{pass}}} & (51) \end{matrix}$

By combining (49), (50), and (51), expression (48) can be evaluated as: H _(shortened)(z)=B ₂(z ⁻¹)z ^(−N) ² z ^(−(p) ² ⁺¹⁾  (52)

The TEQ coefficients are obtained by combining A₁(z) and A₂(1/z) using the appropriate delay element, as described below.

The two-pass shortening model is demonstrated in FIG. 20. FIG. 20( a) shows the impulse response of the original channel. FIG. 20( b) shows the shortened impulse response after the first pass modeling. FIG. 20( c) shows the mirrored image of the shortened impulse response obtained from the first pass modeling. FIG. 20( d) shows the shortened impulse response after the second pass modeling. FIG. 20( e) shows the final combined channel obtained from the two-pass shortening model.

In order to better measure and evaluate the achieved performance of the two-pass shortening model, a signal to total distortion ratio (STDR) is calculated. The total distortion is calculated as the sum of the combined ISI/ICI term and the noise terms, as measured at the slicer input. FIG. 21 shows the STDR obtained on the simulated upstream G.lite channel, with the STDR obtained from the two-pass shortening model shown as a solid line and the STDR obtained from the UTCb model shown as a dotted line. When compared to the STDR obtained from the single-pass TC-AR model, as shown in FIG. 19( b), it is clear that the performance of the two-pass shortening model is substantially better than the performance of the single-pass TC-AR model for time-domain equalization of the ADSL upstream channel.

It should be noted that the two-pass shortening model introduces delay as a side-effect of modeling the mirrored image in the second pass modeling. Specifically, the flat precursor delay of the final combined channel, Δ_(s), can be approximately determined with reference to FIG. 20( e), with the assumption that the shortened channel impulse response length equals the number of zeros (q) in the applied ARMA model. Δ_(s)≈Δ₁(q ₁ −q ₂)+(p ₂+1)  (53)

Since the number of zeros in the first pass modeling, q₁, is greater than the CP length and the number of zeros in the second pass modeling, q₂, is less than or equal to the CP length, it follows that q₁>q₂, and consequently that Δ_(s)>Δ₁. This has the desirable effect of shifting the shortened impulse response closer to the optimal range, as shown in FIG. 22. While both the UTCb model and the two-pass shortening model obtain satisfactory delay values, it should be noted that the UTCb model selects a value for Δ, whereas two-pass shortening model causes delay by the procedure.

FIG. 23 is a logic flow diagram showing exemplary logic 2300 for determining TEQ coefficients for time-domain equalization of the upstream channel using the two-pass shortening model. Beginning at step 2302, the logic applies the ARMA model with p₁ poles and q₁ zeros to the bulk of the original channel impulse response, in step 2304, in order to form a first shortened channel having a first approximation H₁(z)=B₁(z)/1+A₁(z). The logic then forms a time-mirrored image of the first shortened channel, in step 2306, and applies the ARMA model with p₂ poles and q₂ zeros to the first shortened channel, in step 2308, in order to form a second shortened channel having a second approximation H₂(z)=B₂(z)/1+A₂(z). The logic then combines A₁(z) and A₂(1/z) with appropriate delay to obtain the TEQ coefficients, in step 2310. The logic 2300 terminates in step 2399.

It is worth noting that the concept of modeling the reversed in time precursor of the impulse response was explored in [7] and [8]. Justification there was sought in the fact that the ARMA modeling achieves better results in modeling a decaying signal. While the multichannel Levinson algorithm guarantees a stable solution (all poles inside the unit circle), assuming adequate processing precision, the reversing in time operation (replacement of z with 1/z) puts all poles outside the unit circle. In other words, the ARMA model obtained this way is unstable. Such instability is a problem for modeling the reversed in time precursor of the impulse response, as explored in [7] and [8], because the purpose there was to synthesize the channel. Fortunately, such instability is not a problem for the two-pass shortening model of the present invention, since there is no attempt to synthesize the channel, but rather to determine the denominator polynomial coefficients that are used for setting the TEQ filter, which is always stable.

ALTERNATIVE EMBODIMENTS

In an exemplary embodiment of the present invention, the training logic including the two-pass shortening model is implemented predominantly as programmed logic that is stored in a computer readable medium and executed by a Digital Signal Processor (DSP) or other processor within the ATU. The programmed logic may be implemented in any of a number of programming languages. The programmed logic may be fixed in a tangible medium, such as a computer readable medium (e.g., a diskette, CD-ROM, ROM, or fixed disk), or fixed in a computer data signal embodied in a carrier wave that is transmittable to a computer system via a model or other interface device, such as a communications adapter connected to a network over a medium. The medium may be a tangible medium (e.g., analog, digital, or optical communication lines) or a medium implemented with wireless techniques (e.g., microwave, infrared, or other transmission techniques). The programmed logic embodies all or part of the functionality previously described herein with respect to the system.

Alternative embodiments of the invention may be implemented using discrete components, integrated circuitry, programmable logic used in conjunction with a programmable logic device such as a Field Programmable Gate Array (FPGA) or microprocessor, or any other means including any combination thereof.

The present invention may be embodied in other specific forms without departing from the essence or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. 

1. A channel shortening method for training a time domain equalizer comprising: determining a first shortened channel impulse response for a communication channel using a first channel modeling scheme; forming a time-mirrored image of the first shortened channel impulse response; determining a second shortened channel impulse response for the time-mirrored image of the first shortened channel impulse response using a second channel modeling scheme; combining the first shortened channel impulse response with an inverse of the second shortened channel impulse response to obtain a third shortened channel impulse response; and employing the third shortened channel impulse response to configure a time domain equalizer.
 2. The channel shortening method of claim 1, wherein determining a first shortened channel impulse response for a communication channel using a first channel modeling scheme comprises applying an auto-regressive moving average model using p₁ poles and q₁ zeros to form the first shortened channel impulse response having a first approximation H₁(z)=B₁(z)/1+A₁(z), wherein q₁ is greater than a predetermined cyclic prefix length.
 3. The channel shortening method of claim 1, wherein determining a second shortened channel impulse response for me time-mirrored image of the first shortened channel impulse response using a second channel modeling scheme comprises applying an auto-regressive moving average model using p₂ poles and q₂ zeros to form the second shortened channel impulse response having a second approximation H₁(z)=B₁(z)/1+A₁(z), wherein q₁ is less than or equal to a predetermined cyclic prefix length.
 4. The channel shortening method of claim 1, wherein the first shortened channel impulse response has a first approximation H₁(z)=B₁(z)/1+A₁(z) and the second shortened channel impulse response has a second approximation H₂(z)=B₂(z)/1+A₂(z), and wherein combining the first shortened channel impulse response with an inverse of the second shortened channel impulse response to obtain a third shortened channel impulse response comprises combining A₁(z) and A₂(1/z) with appropriate delay.
 5. The channel shortening method of claim 1, wherein the first channel modeling scheme is based upon a pole-zero model having p₁ poles and the second channel modeling scheme is based upon a pole-zero model having p₂ poles and wherein the number of taps in a channel shortening filter equals (p₁+p₂+1).
 6. The method of claim 1, wherein the communication channel is an Asymmetric Digital Subscriber Line (ADSL) upstream channel.
 7. An apparatus comprising a time-domain equalizer for equalizing a communication channel and training logic for training the time-domain equalizer based upon a training signal received over the communication channel, wherein the training logic is operably coupled to determine a set of coefficients for the time-domain equalizer using a two-pass auto-regressive moving average model.
 8. The apparatus of claim 7, wherein the training logic comprises: first channel modeling logic operably coupled to determine a first shortened channel impulse response for the communication channel; inversion logic operably coupled to form a time-mirrored image of the first shortened channel impulse response; second channel modeling logic operably coupled to determine a second shortened channel impulse response for the time-mirrored image of the first shortened channel impulse response; and coefficient determination logic operably coupled to combine the first shortened channel impulse response with an inverse of the second shortened channel impulse response to obtain a third shortened channel impulse response.
 9. The apparatus of claim 8, wherein the first channel modeling logic is operably coupled to apply an auto-regressive moving average model using p₁ poles and q₁ zeros to form the first shortened channel impulse response having a first approximation H₁(z)=B₁(z)/1+A₁(z), wherein q₁ is greater than a predetermined cyclic prefix length.
 10. The apparatus of claim 8, wherein the second channel modeling logic is operably coupled to apply an auto-regressive moving average model using p₂ poles and q₂ zeros to form the second shortened channel impulse response having a second approximation H₂(z)=B₂(z)/1+A₂(z), wherein q₂ is less than or equal to a predetermined cyclic prefix length.
 11. The apparatus of claim 8, wherein the first shortened channel impulse response has a first approximation H₁(z)=B₁(z)/1+A₁(z) and the second shortened channel impulse response has a second approximation H₂(z)=B₂(z)/1+A₂(z), and wherein the coefficient determination logic is operably coupled to combine A₁(z) and A₂(1/z)) with appropriate delay in order to obtain the third shortened channel impulse response.
 12. The apparatus of claim 8, wherein the first channel modeling logic is based upon a pole-zero model having p₁ poles and the second channel modeling logic is based upon a pole-zero model having p₁ poles, and wherein the number of taps the time-domain equalizer equals (p₁+p₁+1).
 13. The apparatus of claim 7, wherein the communication channel is an Asymmetric Digital Subscriber Line (ADSL) upstream channel, and wherein the apparatus is a central ADSL terminal unit.
 14. A program product including a computer readable medium having stored therein a computer program, the computer program for training a time-domain equalizer based upon a training signal received over a communication channel, the program product comprising: first channel modeling logic programmed to determine a first shortened channel impulse response for the communication channel; inversion logic programmed to form a time-mirrored image of the first shortened channel impulse response; second channel modeling logic programmed to determine a second shortened channel impulse response for the time-mirrored image of the first shortened channel impulse response; coefficient determination logic programmed to combine the first shortened channel impulse response with an inverse of the second shortened channel impulse response to obtain a third shortened channel impulse response; and configuration logic operable to employ the third shortened channel impulse response to configure a time domain equalizer.
 15. The program product of claim 14, wherein the first channel modeling logic is programmed to apply an auto-regressive moving average model using p₁ poles and q₁ zeros to form the first shortened channel impulse response having a first approximation H₁(z)=B₁(z)/1+A₁(z), wherein q₁ is greater than a predetermined cyclic prefix length.
 16. The program product of claim 14, wherein the second channel modeling logic is programmed to apply an auto-regressive moving avenge model using p₂ poles and q₂ zeros to form the second shortened channel impulse response having a second approximation H₂(z)=B₂(z)/1+A₂(z), wherein q₂ is less than or equal to a predetermined cyclic prefix length.
 17. The program product of claim 14, wherein the first shortened channel impulse response has a first approximation H₁(z)=B₁(z)/1+A₁(z) and the second shortened channel impulse response has a second approximation H₂(z)=B₂(z)/1+A₂(z), and wherein the coefficient determination logic is programmed to combine A₁(z) and A₂(1/z) with appropriate delay in order to obtain the third shortened channel impulse response.
 18. The program product of claim 14, wherein the first channel modeling logic is based upon a pole-zero model having p₁ poles and the second channel modeling logic is based upon a pole-zero model having p₁ poles, and wherein the number of taps the time-domain equalizer equals (p₁+p₂+1).
 19. The program product of claim 14, wherein the communication channel is an Asymmetric Digital Subscriber Line (ADSL) upstream channel. 